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0\ ' Abstract 



Quantum parallelism is the main feature of quantum computation. 
In 1985 D. Deutsch showed that a single quantum computation may be 
sufficient to state whether a two-valued function of a two-valued variable 
is constant or not. Though the generalized problem with unconstrained 
C ■ domain and range size admits no deterministic quantum solution, a fully 

probabilistic quantum algorithm is presented in which quantum paral- 
lelism is harnessed to achieve a quicker exploration of the domain with 
respect to the classical "sampling" strategy. 



> 

o 

<N 

^ ■ 1 Introduction 

O 

f* 1 **. ■ A quantum computer is a quantum system whose time evolution can be thought 

0^ ' of as a computation, much in the same way as we think of the time evolution 

(— 1 . of a pocket calculator to be a computation. 

Q_i' For our pourposes it will suffice to model the quantum system as a "black 

box" and focus our attention on two discrete observables out of a complete 
£h \ set, which we shall call the input and output register. Following the standard 

notation £Q , we shall indicate the computation of a function / : A — > B as 

|x> ® |0> ~ |x> ® |/(x)>, 

the first ket describing the state of the input register and the second the state 
of the output register. Kets are labelled according to the elements of A and B 
& ' they represent. 

One of the most powerful features of quantum computation is quantum par- 
allelism. The superposition principle of quantum mechanics allows us to prepare 
the computer in a coherent superposition of a set I C A of input states. After 
a single run, all of the corresponding outputs /(x) appear in the final state, 
according to the time evolution 

5>)®|0)^5>)®|/(x)) (1) 

xei xei 



Unfortunately, this is no "pay one, take N". In fact, the result is an entangled 
state of the input and output registers and there is no single measurement 
allowing us to extract from it all the computed values of f(x) 0. However, 
it may well be possible to distil from this final state some global property of 
the function, thus exploiting quantum parallelism. One of the most famous 
examples was presented by D. Deutsch, who showed that a single quantum 
computation may suffice to state whether a two-valued function of a two-valued 
variable is constant or not [2 El- D. Deutsch and R. Jozsa later generalized this 
result |3] showing that the problem of classifying a given function 

/:{0,...,2JV-1}-{0,1} 

as "not constant" or "not balanced" can be solved in polynomial time by means 
of a quantum computer (the time required by a classical solution is exponen- 
tial). Also D. R. Simon showed that the problem of determining if a function is 
invariant under a so-called XOR mask, while it is classically intractable, admits 
an efficient quantum solution 0. 

All of the algorithms cited above (apart from the last, for which Simon also 
considered a fully probabilistic generalization) are characterized by a variable 
running time and zero error probability. They consist of a non-classical compu- 
tation like JJJ followed by a measurement of the final state of the computer, as 
a result of which either the correct answer is obtained or the relevant informa- 
tion is destroyed and an explicitly inconclusive result is returned. In the latter 
case one has to go through the whole procedure again, so that only an average 
running time for the algorithm can be estimated. 

Global properties of functions that can be determined by such an algorithm 
are said to be Computable by Quantum Parallelism (QPC).This definition was 
put forward by Jozsa [2] who also demonstrated that, at least in the case of 
two-valued functions, the QPC properties that can be determined by means 
of a single computation are an exponentially small fraction of all the possible 
global properties. 

In this paper we tackle the general problem of stating whether a function 

/:{0,...,iV-l}^{0,...,M-l} 

is constant or not. 

We show that for N > 2 or M > 2 this property is not QPC, meaning that 
any measurement following a computation like Q has a finite probability of 
yielding a wrong result. We therefore investigate the power of quantum paral- 
lelism in a fully probabilistic setting. Assuming that the (classical) computation 
of / on k randomly sampled points yields a constant value, we calculate the pos- 
terior probability that / is actually constant. We then compute the analogous 
probability for a quantum algorithm requiring the same number of computa- 
tions of /. Comparison of the two results shows that our quantum strategy 
allows making a better guess at the solution, its indications being more likely 
to be correct. 



2 Is a direct generalization of Deutsch's example 
possible? 

We shall now briefly recall the classical example put forward by D. Deutsch |jj], 
before confronting the problem of its generalization. Suppose we are given a 
function / : {0, 1} — ► {0, 1} and we are interested to know whether / is constant 
or not. Of course there are only four such functions (i.e. four instances of the 
problem), namely 

/i(0) = f/ 2 (0) = l f/ 8 (0) = f/ 4 (0) = l () 

/i(l) = \/a(l) = l l/ 3 (l) = l l/ 4 (l) = [Z) 

If all we can use is a classical computer, there is only one way to do the job: 
we must compute both /(0) and /(l) and compare them to check if they are 
equal. On the contrary, since in this simple case the property "/ is constant" is 
QPC, a quantum computer gives us a fair chance of finding the solution at the 
cost of the single computation 

-L(|o) + |i» ® |o> -> -^(|o)|/(o)) + |i)|/(i))). 

After the computation, the calculator halts with its input and output regis- 
ters in one of four possible states, corresponding to the four possible functions: 

I/O = ^(|o)|o> + |i>|o» \h) = ^(|o)|i) + |i)|i» 

|/ 3 ) = ^(|0>|0> + |1)|1» |/ 4 > = ^(|o)|i) + |1>|0» 

Since the above states are linearly dependent, they cannot be distinguished 
with certainty. This means that no measurement can establish which function 
was actually computed, or, which is the same, it's impossible to extract from the 
final state both the values of /(0) and /(l). However, we need only discriminate 
|/i) and I/2), the final states yielded by the constant functions, from I/3} and 
I/4). This can actually be done by measuring on the final state of the two 
registers an observable with the following non-degenerate eigenstates: 

|SAME) = i(|0) + |1»(|0) - |1» |DIFFERENT) = |(j0) - |1»(|0) - |1)) 

|FAIL) = |(j0) + |1»(|0) + |1)) |ERROR) = |(|0> - |1))(|0) + |1)) 

These four states can be thought of as "flags" indicating the result of the compu- 
tation and have been named according to their meaning. This becomes clearer 
as soon as we rewrite the final states Q on the basis of the above eigenvectors: 

I/O = 4=(|FAIL) + |SAME)) I/3) = -^(IFAIL) + |DIFFERENT)) 

|/ a ) = ^=(|FAIL) - |SAME)) |/ 4 ) = ^ (I FAIL) - |DIFFERENT)) 



It is now evident that 

1. projection along the eigenvector |SAME) can only take place if the state 
of the computer is either |/i) or I/2), i.e. when / is constant; 

2. likewise, projection along the eigenvector | DIFFERENT) can only occur 
if / is not a constant function; 

3. regardless of the final state |/j) of the two registers after the computation, 
the measurement can yield a |FAIL) state with probability 1/2; 

4. state I ERROR) is orthogonal to the four final states listed above, and 
therefore it should show up only as a consequence of noise-induced errors. 

Note that, according to the definition of the QPC class 0, the quantum 
algorithm can either give us the correct answer or no answer at all: as long as 
everything works properly, we'll never get a wrong result. This comes in handy 
when we are asked to solve a decision-theoretic problem in which simply waiting 
has a much higher utility than taking a wrong action. In this case we can discard 
the FAIL results and base our decisions upon the meaningful answers, which we 
know to be correct. 

The straightforward generalization of Deutsch's example would go as follows. 
Given a function 

/:{0,l,...,iV-l}-f{0,l,... ) M-l} (4) 

and assuming we can perform the non-classical computation 

N-l 

x— 

we are asked to devise an observable O on the joint state of the two registers 
such that, after a single measurement of O, we can either 

1. obtain a reliable indication that function / is constant; 

2. obtain an equally reliable indication that / is not constant, or finally 

3. get an explicitly inconclusive result. 

Let Ti be the Hilbert space of the joint states of the input and output regis- 
ters, li £ is the basis of Ti formed by the (non-degenerate) eigenstates of O, all 
that is needed would be the existence of two disjoint subsets C,C C £ such that 

i. all the final states obtained from the computation of constant (non-con- 
stant) functions have a non-zero projection along C (along C); 

ii. the final states corresponding to non-constant (constant) functions are 
orthogonal to C (to C). 




N-l 

= £|x>® |/(z)), (5) 



These two requirements are evidently fulfilled in the case of Deutsch's example, 
as can be easily seen by taking C = {|SAME)} and C = {| DIFFERENT)} (for 
further details see (2j)- 

However, as soon as the domain and range of the function / grow larger, 
requirements i. and ii. become incompatible. What happens is that whenever 
N > 2 or M > 2 the computation of constant functions yields final states that 
are linearly dependent upon those obtainable from non-constant functions. This 
clearly forbids the existence of C, since the final states coming from non-constant 
functions cannot be orthogonal to C. 

In other words, the global property "/ is constant" is no longer QPC in the 
general case. Note that, as demonstrated by Jozsa |2], this result is independent 
of the particular superposition used as the input state for the non-classical 
computation J5j- 

3 Probabilistic generalization 

The fact that the investigated property of / is not QPC compels us to work in a 
fully probabilistic setting in order to cope with the possibility of wrong results. 
Preserving the general structure of the algorithm as outlined at the beginning 
of the preceding section, we note that we can still devise an observable O' 
such that any "constant" ket has a large projection along a subset C of the 
eigenstates of 0'\ for example, we can arrange for Span(C') to be the very 
space spanned by the "constant" vectors. The problem is now that since "non- 
constant" kets generally have a non-zero projection along C, measuring O' 
no longer ensures a clear-cut distinction between constant and non-constant 
functions. However, since "non-constant" final states do have some component 
along the orthocomplement of C, measuring O' still gives some (probabilistic) 
information about the computed function /. We are left with two asymmetrical 
possibilities (actually, as we shall see, a more convienient choice for C also makes 
an explicitly inconclusive result possible): 

a. measuring 0' yields an eigenvalue associated to the orthocomplement of 
Span(C') in 7i. Since this can only happen if the computed function / is 
not constant, this is an exact solution to the problem. 

b. measuring O' projects the final state of the two registers onto a state 
in C . If the computed function were constant, this would be the only 
possibility; unfortunately, as seen above, other functions may also yield 
the same result. We have therefore obtained only a probabilistic indication 
about / being constant. 

It is now clear that the generalized algorithm is essentially similar to a classi- 
cal probabilistic algorithm, in that its results are not necessarily correct. Never- 
theless, as we shall see in the following sections, the posterior probability of the 
function actually being constant after a result of type b. is obtained turns out 
to be much larger for our quantum algorithm than for the classical "sampling" 
strategy (see section |2J. 



In the rest of this section we shall deal with the choice of the observable, 
which constitutes the core of the algorithm. 

3.1 Functions and matrices 

We would now like to introduce a correspondence between the Hilbert space 
TL of the two registers of the computer and the space M. (M, N) of complex 
matrices with M rows and N columns. 

Let B be the computational basis of TL, the first ket referring to the state of 
the input register and the second to that of the output register: 

B = {|0)i|0) o , |0)i|l) o , . . . , \N- l)i|M - 1) } 

We define the isomorphism ip : TL — > A4(M, N) by identifying ip (|n)j|m) ) with 
the M x N matrix whose elements are 

[ip(\n)i\m) )]ij = 5 m ,i8n,j- 

The isomorphism ip, which maps the elements of B onto the canonical basis of 
M.(M, N), is then extended by linearity to the whole TL. 

Since the final state of the computer after the computation of function / is 



N-l 

I/) = ~m E l")il/(»)>o. 



JV-l 

N ■ „ 

the entries of the corresponding matrix F = ip{\f)) turn out to be (F)„ ]Tn = 
5 m fi n )i so that F somehow resembles the graph of / drawn with the "x" axis 
along the rows and the u y" axis pointing down|J. 
It is easy to check that the scalar product 

F • G := Tr (F^G) (6) 

in M.(M, N) is preserved by ip, i.e. (f\g) = F • G for any two vectors |/), \g) of 
H (we write (-|-) for the scalar product in TL.) 

3.2 The Fourier Transform Matrices basis 

We shall now construct the observable O' as specified at the beginning of this 
section. 

An observable in Ti is identified by its MN eigenstates that form an or- 
thogonal basis, or, using the isomorphism ip, by MN orthogonal matrices in 
Ai(M,N). We propose to take the MN two-parameter matrices F Q/ g with 
a = 0, 1, . . . , M — 1 and (3 = 0, 1, . . . , N — 1 whose entries are defined by 

(F a/J ) = — e i&<xmi^0n /« 

v ^ )m - n Vmn { ' 



1 For convenience we write Fq.o instead of Fi i for the upper left element of matrix F. 



We shall call the above matrices Fourier Transform Matrices (FTM). We 
recall that given a matrix A <E M. (M, N) , its two dimensional Discrete Fourier 
Transform A e M(M,N) is defined as 



(*) 



1 

<*,P y/MN 



A) n = -^= J2 (A) m ,„ e-^^e-W" = Tr F^A = ¥ a , p 



Therefore the components of A on the FTM basis are the entries of its Discrete 
Fourier Transform A. 

We still have to decide which eigenvectors are to be taken as an indication of 
the function / being constant. In other words we have fixed the basis £ but have 
yet to choose the subset C. We take C as composed by the M— 1 matrices F Qi o, 
with a = 1, 2, . . . , M — 1. It is easy to check that C U {Fo,o} spans the subspace 
of A4(M,N) generated by the set of the M matrices {f{\f))\f constant} cor- 
responding to the constant functions. We have not included Fo.o in C because 
the projection probability of the computer's final state on Fo,o is the same for 
all functions: 

F ,o • F = Tr (Fj i0 F) = -±= VF e M(M, N). (8) 

^ V ■£■'■*■ 

Therefore Fo,o has the same role that state |FAIL) had in Deutsch's example. In 
the following we shall put (p^ 1 (Fo.o) = |FAIL) and we shall speak equivalently 
of the matrix F ,o in M(M, N) or of the state |FAIL> in H. 

Likewise, since Fo,^ • F = for all (3 ^ and every matrix F, subset 

terror = {F „3|l</3<^-l} 

plays the role of the |ERROR) state in Deutsch's example (section EJ. 
The remaining FTM matrices constitute set C: 

C = {F Qi/3 |l <a<M-l,l</3<AT-l}. 

Note that we did not put a prime on C, since it does satisfy both conditions i. 
and ii. listed in section [21 This accounts for the lack of symmetry we pointed 
out at the beginning of section |21 

4 Efficiency of the probabilistic generalization 

Suppose we run the quantum algorithm k times on the same function / and we 
always get an indication that / is constant (a projection onto C). We need to 
gauge the reliability of this result, which we can do by computing the posterior 
probability 

Pr(rcally constant| k "constant" outcomes) (9) 

(Pr(const| k) for short) that the function really is constant. This quantity 
can also be used to compare the efficiency of the quantum algorithm against 



a conventional classical solution, since what we are looking for is a procedure 
giving the lowest probability of error in change for the same computational 
effort. 

To evaluate © we use Bayes' theorem, that is 

_ , , , . Pr(const A k) , „. 

Pr(const| k) = l pr(fc) ' , (10) 

where by Pr(const A k) we mean the joint probability that / is constant and 
that k runs of the algorithm yield a "constant" outcome, corresponding to the 
final state being projected along C By the product rule, this can be expressed 

as 

Pr(const A k) = Pr(const) Pr(fc|const). (11) 

Assuming a uniform probability distribution on all the possible functions, wc 
have Pr(const) = M/M N . Regardless of the computed function /, FAIL re- 
sults have a 1/M probability of showing up (see equation [SJ . This leads to 
Pr(fc|const) = (1 — 1/M) . As a consequence (|ll(l becomes 

Pr (const Afc) = M 1 ~ N (l - — J . (12) 

The denominator of l|l(J|) can be expanded over all the M N possible functions 
of type (0: 

M N 

Pr(k) = Y,Pr(f i )Pr(k\f i ). (13) 

i=\ 

Since we assumed the input functions to be uniformly distributed, we have 

Pv(f i )=M~ N Vi (14) 

The k runs of the quantum algorithm are stochastically independent and that 
implies that the likelihoods Pr(fc|/) appearing in (|13(l are simply given by 

Pr(fc|/)=[Pr(C'|/)] fc , (15) 

where with Pr(C |/) we mean the likelihood of a single run, the probability of 
a projection onto C when the function is /. So we can concentrate only on 
Pr(C |/), which, with the help of the sum rule, can be expressed as 

Af-l 

Pr(C'|/) = Pr(*|/) - Pr(FAIL|/) = ^ Pr(^,|/) - Pr(FAIL|/). 

a =o (16) 

Here R stays for event "after the measure the computer's final state projects 
itself onto the subspace of all constant functions" , & a for the projection onto 



the matrix K Q , which represents the a-th constant function and is defined as 
(^a)mn = ^m,a, and FAIL for the projection onto |FAIL). We have used 
the fact that, thanks to the orthogonality relations, events &o,&i,... ,&m-i 
are mutually exclusive and so are C and FAIL and that C U FAIL = & and 

U Q= o &a = & 

As we have seen in (JHJ, Pr (FAIL|/) = 1/M V /. On the other hand 



where 



Pr(jy/) = |K 



/ /o,o 
/1.0 



|Tr(KiF) 



(17) 



/o,i 
fi.i 



\f- 



M-1,0 



I 



M-1,1 



fo,N-l \ 
fl,N-l 



Im-i.n-i J 



(18) 



is the matrix related through isomorphism <p to the computer's final state when 
the function is / (Note that most of the elements of F are zero, since f m .n = 



■./(«) 



Let us now compute explicitly the trace that appears in the r.h.s. of 117(1 : 

Ir (r\ Q rJ = / X a lm,nV)m,n = ~TJ / J O m ,alm,n = ~Z} / < Ja,n- 



Equation l|17|l then becomes 



Pr(£«|/) = 



N 2 



N-l 



/ j Ja,n 



n=0 



(19) 



(20) 



Equation (|19fl contains the sum of the elements appearing in the a-th row of 
matrix (|18|l . Since matrix F has a sole one in any column, this sum is equivalent 
to the number of ones in the a-th row of F. This gives us an idea for a smart 
classification of all the possible functions appearing in H13[): we associate with 
every function an N + 1-uple (jo, ji, ■ ■ ■ , Jn), where ji is the number of rows of 
its corresponding matrix F with I ones and N — I zeroes. Doing so we can replace 
the sum over i appearing in (|13fl by a sum over the N + 1-uples (Jo,ji, ■ ■ ■ , Jn), 
with conditions 



< ji < M V /, 

h + 2J2 + --- + Nj N = N, 
Jo +.7i H 1- Jn = M. 



(21) 

(22) 
(23) 



Condition ill' 2 1) expresses the requirement that the total number of ones in matrix 
F is N (or, since each column contains a sole one, that F has 7Y columns), 
while l|23(l is equivalent to the condition that F has M rows. In the following, 



we shall indicate with X the set of the N + 1-uplcs (jo,ji, ■ ■ ■ ,jn) that satisfy 
equations lj?Tll - (|25j) . 

Note that, since every N + 1-uple corresponds to more than one function, 
when summing over the N + 1-uples we must use the right combinatorial factors. 
These, for a fixed N + 1-uple (Jo,ji, ... ,Jn)i are given by: 



C 



N\ 



Ml 



30,3l,--- ,3N 



(o\y° (ll) J1 ■ ■ ■ (NI) JN join! ■■■jnV 



(24) 



the first term corresponding to column permutations and the second to row 
permutations. 

We can now use equation (|2*n|) together with this way of classifying the 
functions to evaluate the total likelihood Pr (i\\f) that appears as the first term 
in equation (|16[) . If fj Q j 1 ,... ,j N stands for a function corresponding to the N+ 1- 
uple (io,ii,... ,Jn), 

M-l N 

a=0 1=0 

Consequently l|16[l becomes 

1 N 1 

and (|15|l becomes in turn 



/=o 



Pr(k\fjo,h, 



1 N 



N 2 



/=o 



2_J_ 

M 



Now we can sum over all the possible N + 1-uples with conditions (|21[l - 
l|23|) and with the combinatorial factors (|24|l . obtaining the expression of equa- 
tion 1)13(1 in the quantum case: 



Pr(fc) = £ 



(jodii— ,Jn)&1 



M N 



N "1 fe 



iV 2 



/=o 



C 



JCJlj — JJV 



(N.B. we have used the fact that Pr(/) = M~ N V /, since we suppose a prior 
uniform probability distribution on all the functions). 

Finally, using also equation l|12l) we can express the posterior probability l|10[l 
in the quantum case as 



Pr(constjfc) 



^ (!-£)' 



b'oj'i,--- ,iiv)e2: 



iV 



1 v^" /■/ ■ \ 2 _ 1 



C 



30 ,]!,-■■ ,3N 



(25) 



In the following section we will derive the corresponding expression for the 
classical case. 
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5 Efficiency of the "sampling" algorithm 

There exists at least one obvious classical probabilistic algorithm that can be 
used to spot constant functions. We can simply compute the value of 

f:{Q,...,N-l}->{0,...,M-l} 

on k randomly chosen points of its domain and decide that / is constant if its 
restriction to the sampled points is. This procedure, which we shall call the 
"sampling algorithm" , evidently constitutes the best possible classical strategy 
to solve the problem, since it uses up all the information we can gain on / by k 
classical computations. 

In order to allow a direct comparison with the quantum algorithm, we have 
to find out what the posterior probabilities Pr(const|fc) are in this case. Starting 
again from Bayes' theorem, we can express the numerator of equation 1)10(1 as 

Pr(const A k) = Pr(const)Pr(/c|const) = Pr(const), 

that by l)14[) is equal to M/M N = M 1_N (note that in the classical case 
Pr(/c|const) = 1, since no FAIL results exist). 

We must now evaluate the denominator of Bayes' formula, namely equa- 
tion (|13|l . Choosing the k inputs at random actually turns out to be inessential 
as long as the functions are uniformly distributed: sampling the first k points 
0, 1, ..., k — 1 is just as good. Let us therefore divide all the possible functions 
into two classes. The first is made up by those for which at least the first k 
values are constant; they are MM N ~ k = M +1 . All the other functions 
belong to the second class. As a consequence, the likelihoods that appear in the 
r.h.s. of Ijl3(l arc simply given by 

,,, ., _ J 1 if / belongs to the first class 
r ^ ' ' ~ \ otherwise 

Putting this expression in 1)13(1 . and recalling l|14|l. we can rewrite (1101) as 

M l - N 
Pr(const|fc) = = M k ~ N . (26) 

M N-k+l/ M N 

This result is to be compared with equation 1)25)1 ■ which gives the corre- 
sponding posterior probability after k runs of the quantum algorithm. In order 
to do so, formula (|25[) must evidently be evaluated by means of a (classical!) 
computer. Before listing the numerical results, however, we are going to discuss 
two special cases that can be solved analytically in the limit of large N. 

6 Worst case and best case analysis 

We shall now analyse the behaviour of our generalized quantum algorithm in 
the worst possible case, that is when the computed function has maximum 



11 



probability of being mistaken for a constant function, even if it is not. 
occurs quite naturally for a matrix of the following kind: 



This 



A ; 



/111 



V 



1 ° M 

1 




/J 



>M 



representing a function g that is constant on its whole domain but for one point. 
The resulting probability of error is given by the squared modulus of the 
projection of G on the space spanned by the set {Ko, . . . , Kj\/-i} of the matrices 
associated to constant functions q that is 



Pr, 



1 



2 

N 



2 

7V2' 



Therefore Pr e tends to one in the limit of large N. Here again, in order to 
compensate for this we have to run the quantum algorithm several times, say k 
(classically, we would have to sample more and more points). If we want to keep 
the probability of being "cheated" by an almost-constant function g as low as 
a given value e, we evidently have to choose k so that (Pr_g) = e, that is 

lne 



ln(l-A + 1 | T )- 

As we would expect, k does tend to infinity in the limit of large N, meaning 
that exploring an even larger domain requires an infinite number of computa- 
tions. It is nevertheless interesting to study the ratio ij = k/N of the number 
of runs to the number of elements in the domain. In the limit of large N, this 
becomes 



k lne 
hm ri = lim — = , 

JV-.00 AT^oo N 2 



(27) 



which is a constant independent on N. 

Therefore, if we are required to perform the computation with a worst case 
error probability e, we have to run our quantum computer a number of times 
which, in the limit of large N, is a definite fraction r/ of N. Equation (|27|l can 
in this case be inverted to obtain e as a function of rj, yielding e = e~ 2 ''. 

Coming now to the classical case, sampling a fraction r\ of the points in 
the domain (which requires k = rjN computations) entails having a probability 
e = 1 — T) of mistaking g for /o- 

In figure ^ we plotted the worst case probability of error against r\ for both 
the quantum and the classical algorithm in the limit of large N. In the quantum 



We assume that M is so large that the probability of a FAIL result is negligible. 
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Figure 1: Worst case probability of error e versus the ratio rj — k/N of the 
number of computations to the number of points in the domain. Solid line: 
quantum algorithm; dashed line: classical "sampling" algorithm. 

case e decreases more rapidly and stays well below the classical probability of 
error as long as r\ is not too close to 1 (remember that the "sampling" algorithm 
is no longer probabilistic if we compute our function over its entire domain!). 

Looking now at the best case, we find that there is again a single class 
of functions which is easily dealt with by both algorithms, that is one-to-one 
functions or permutations of the points in the domain (this obviously requires 
N to be equal to M). 

Using the classical sampling algorithm, one can evidently be sure to distin- 
guish an invertible function from a constant one with only two computations, 
since the former does not assume any value in its range more than once. 

In the quantum case, permutations are associated with matrices having ex- 
actly one "1" in each row and in each column. Such matrices turn out to be 
orthogonal to C. Therefore, a measurement of the final state yielded by a per- 
mutation can either result in a FAIL or in projection along C, which indicates 
that the function is not constant. Now FAILs can only be obtained with prob- 
ability 1/N, which luckily vanishes as N grows larger. We conclude that, in 
the limit of large N, the quantum algorithm is practically guaranteed to spot 
a one-to-one function at first sight, after a single computation, thus doubling 
the efficiency of the classical algorithm. 

By the way, we note that if we only had to tell constant functions from 
permutations — if our practical problem didn't require us to deal with non- 
invertible, non-constant functions — we would be back to the original situation 
of Deutsch's example. We can now see what was so special about the four func- 
tions considered by Deutsch in his original example (see equation[2J ■ When both 
the domain and the range consist of two points only all non-constant functions 
turn out to be one-to-one, so that all ambiguity is removed. 
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7 Numerical Results 

We are including, in figures [5] through [3] some comparative graphics of the 
posterior probabilities Pr(const|fc) expressed by equations l|26|) and i|25|) versus 
the number k of successful computations effected (by successful computation we 
mean all computations barring FAIL results). 

As our previous analysis suggested, the quantum algorithm turns out to 
be far more efficient than the classical "sampling" algorithm for small values 
of k/N . We emphasize that this result is entirely dependent upon the use of 
quantum parallelism. This highly non-classical feature of quantum computation 
apparently allows a quicker exploration of the domain of function /, even in the 
case that the investigated property is not QPC. 

The posterior probability Pr(const|fc) we used for our numerical calculations 
is conditoned to a sequence of k "constant results" of the quantum algorithm. 
We have overlooked the possibility of obtaining one or more FAIL outcomes. 
This is particularly significant when M = 2 (figures [3 and 01 , because in such 
cases a FAIL result has a 1/2 probability to show up. This means that in order 
to obtain k projections of the final state of the computer along C one must 
expect to run the quantum computer Ik times. Nervertheless, as the graphics 
show, the quantum strategy always turns out to be convenient, at least for small 
values of k/N. 

We finally note that as N and M grow larger (see for instance figure EJ 
the resulting posterior probabilities turn out to be so low that both the quan- 
tum and the classical algorithm are virtually useless. This entirely depends on 
our assumption of an uniform distribution over functions /, which is probably 
eccessively penalizing. In real-world situations, we can expect the quantum 
algorithm to be useful in any situation in which the "sampling" algorithm is 
successfully employed at the present day. 
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Figure 3: Posterior probability Pr(const|fc) versus k (N = 16; M = 2; log scale). 
Boxes: quantum algorithm; crosses: classical "sampling" algorithm. 




Figure 4: Posterior probability Pr(const|fc) versus k (N = 16; M = 8; log scale). 
Boxes: quantum algorithm; crosses: classical "sampling" algorithm. 
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Figure 5: Posterior probability Pr(const|£;) versus k (N — 24; M = 24; 
log scale). Boxes: quantum algorithm; crosses: classical "sampling" algorithm. 



17 



